CDS

Accession Number TCMCG019C13118
gbkey CDS
Protein Id XP_022941908.1
Location complement(join(4220651..4221160,4221590..4222504))
Gene LOC111447126
GeneID 111447126
Organism Cucurbita moschata

Protein

Length 474aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023086140.1
Definition anthocyanidin 3-O-glucosyltransferase 5-like isoform X1 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category CG
Description Belongs to the UDP-glycosyltransferase family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R02594        [VIEW IN KEGG]
R03605        [VIEW IN KEGG]
R04005        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
RC00171        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K12356        [VIEW IN KEGG]
EC 2.4.1.111        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00940        [VIEW IN KEGG]
map00940        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGACTCCCCAACCCACGTCGCTCTCATCTCAAGCCCCGGGATGGGCCACCTCTTCCCCTCTCTCGAGCTCGCCACGCGACTCTCCACGCGCCACCACCTCACCCTCACTGTTTTCCTCGTCACCTCCCACTCCTCCTCCGCCGAAAATAACGTCGTTGCCGCCGCCGAGGCCACTGGCCTCTTTACTGTCGTCGAACTCCCACCGGCTGACATGTCCGACGTCACCGATTCCACTGTCGTTGGCCGCCTTGCCATCACCATGCGCCGCCACGTCCCGGCTCTCCGCTCGGCCATCTCTGCTCTCACCTCCCGCCCCTCCGCCCTCATTGCAGACATCTTCTCCACCGAGGCCTTTGCCGTCGCCGACGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAATGCATGGTTTTTAGCCTTGACCATTTACGCCCAGGTTCTCGACAAGCAAATCGTCGGGCAGTACGTGGACCAGAAAGAACCGCTTCAAATCCCTGGATGCGAACCGGTTCGTCCATGTGACGTCGTAGACCCGATGCTGGACCGGACCGAATCCCAGTATTACGAGTACGTCAAAATGGGGAGGGCAATAGCGTCGAGCCACGGCGTTTTGGTTAACTCGTGGGATGAGTTGCAAGGTCGCACACTCGCATCGTTCAAAGATCGGAGTCTGTTGGGTCGAGTAATGAACGCGCCGGTTTACTCGATCGGACCGATCGTGCGACATTTCGGCTCTGGGAAAGACGGCTCGAGCGAGCTGTTCAACTGGTTGAGGAAGCAGCCCGGGAAGTCGGTGATTTACGTGTCGTTCGGGAGCGGCGGAACGTTGTCGTTTGAGCAAATGACGGAAATGGCTCATGGCTTGGAGTTGAGTCGGCAGAGATTTGTTTGGGTGGTCCGGCCGCCCACGGTGAGGTCGGATGCGATGTTTTTCACGACAGGGGATGGGAGTGAGGACCAATCAGAGGCGAGATATTTGCCGGAGGGGTTTTTGGAGCGGACTAGCGAGGTGGGGTTTCTGGTGTCGATGTGGGCGGAGCAAACGGCGGTGCTGGGGAGTCCGGCAGTGGGGGGATTTTTCACGCACGGCGGATGGAACTCATCATTGGAAGGAATTACGAAGGGAGTTCCGATGATAGTGTGGCCGTTGTACGCGGAGCAGAGGATGAACGCCACGATGCTGGCGGATGAGATGGGGGTAGCGGTGCGGCCGAAGGAGCTGCCAGGGAATGCGGTGATCGGGAGGGAGGAGATCGCGGCGATGGTGAGGAAGATAATGGCGGAGGAGGACGAAGAAGGGAGAGCCATAAGAGCGAAGGCGATGGAACTTCAACGAAGTGCAGAAAAGGCCTGTGCGCAAGGAGGCTCGTCGTACGAGAACTTTGCTCGAGTTGTGAAACTTTTTGGCCGTACGGGATAA
Protein:  
MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATGLFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFAVADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDPMLDRTESQYYEYVKMGRAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMNAPVYSIGPIVRHFGSGKDGSSELFNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVWVVRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVGGFFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREEIAAMVRKIMAEEDEEGRAIRAKAMELQRSAEKACAQGGSSYENFARVVKLFGRTG